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( TARGET WORD INFORMATION 
^ EXTRACTION PROCESSINGS 



OBTAIN EACH ENGLISH WORD FROM ENGLISH ARTICLE AND 
OBTAIN TARGET WORD CANDIDATES FROM ENGLISH-JAPANESE 
TRANSLATION DICTIONARY 



OBTAIN EACH JAPANESE WORD FROM JAPANESE ARTICLE AND 
OBTAIN TARGET WORD CANDIDATES FROM JAPANESE-ENGLISH 
TRANSLATION DICTIONARY 



- S51 
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SELECT TARGET WORD CANDIDATES APPEARED IN THE 
JAPANESE ARTICLE FROM THE OBTAINED TARGET WORD 
CANDIDATES OF EACH ENGLISH WORD 



— S53 



REGARD ONE TARGET WORD CANDIDATE OF ENGLISH WORD 
Em MOST APPEARED IN THE JAPANESE ARTICLE AS THE 
TARGET WORD (Jm) OF THE ENGLISH WORD Em, AND CREATE 
A WORD PAIR (Em.Jm.Hm), Hm : APPEARANCE FREQUENCY 



SELECT TARGET WORD CANDIDATES APPEARED IN THE ENGLISH 
ARTICLE FROM THE OBTAINED TARGET WORD CANDIDATES 
OF EACH JAPANESE WORD 



REGARD ONE TARGET WORD CANDIDATE OF JAPANESE WORD 
Jn MOST APPEARED IN THE ENGLISH ARTICLE AS THE 
TARGET WORD (En) OF THE JAPANESE WORD Jn ,AND CREATE 
A WO RD PAIR (En.Jn.Hn), Hn : APPEARANCE FREQUENCY 

I 



MERGE TWO WORD PAIRS (Em,Jm,Hm) AND (En,Jn,Hn) 
IN ONE WORD PAIR (Em, Jm, Hm + Hn) IN CASE OF 
"Em = = En, Jm = = Jn", AND NOT MERGE THE TWO WORD 
PAIRS OTHERWISE 



SELECT ONE WORD PAIR INCLUDING ONE ENGLISH WORD AND 
MAXIMUM FREQUENCY IF A PLURALITY OF JAPANESE WORDS 
EXIST FOR THE ONE ENGLISH WORD, AND DELETE OTHER 
WORD PAIRS FOR THE ONE ENGLISH WORD 



OUTPUT EACH WORD PAIR AS THE TARGET WORD 
INFORMATION OF EACH ENGLISH WORD 



( END ) 
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PHRASE ALIGNMENT PROCESSING ] 



\ 




MORPHOLOGICALLY ANALYZE A HEADLINE AND 
EXTRACT A NOUN PHRASE FROM THE HEADLINE 


\ 


/ 


MORPHOLOGICALLY ANALYZE AN ARTICLE BODY 
(HEAD SENTENCE) AND EXTRACT NOUN PHRASE 
CANDIDATES FROM THE ARTICLE BODY (HEAD SENTENCE) 


\ 


/ 


COUNT A NUMBER OF COINCIDENT WORDS BETWEEN 

THE NOUN PHRASE OF THE HEADLINE AND 

THE NOUN PHRASE CANDIDATE OF THE ARTICLE BODY 


v 


/ 


CALCULATE A COINCIDENCE RATIO (= A NUMBER OF 
COINCIDENT WORDS/A NUMBER OF ALL WORDS IN 
THE NOUN PHASE CANDIDATE), AND EXTRACT THE 
NOUN PHRASE CANDIDATE OF WHICH THE COINCIDENCE 
DEGREE IS ABOVE A THRESHOLD 



JZ 



DELETE A PAIR OF CORRESPONDED NOUN PHRASES 
IF THE NOUN PHRASE OF THE ARTICLE BODY IS A 
SUBSET OF THE NOUN PHRASE OF THE HEADLINE 



SELECT ONE NOUN PHRASE OF WHICH THE 
COINCIDENCE DEGREE IS THE HIGHEST FROM A 
PLURALITY OF NOUN PHRASES OF THE ARTICLE BODY 
IF THE PLURALITY OF NOUN PHRASES CORRESPOND 
TO THE NOUN PHRASE OF THE HEADLINE 
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OUTPUT EACH PAIR OF CORRESPONDED NOUN PHRASES 
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( END ) 



FIG.7 



OBLON, SPIVAK, ET AL 
DOCKET #: 218475US2TTCRD 
INV: Kenji ONO 
SHEET J0_OF_20_ 



/ ABBREVIATION \ 

. ESTIMATION 
V PROCESSING J 



DIVIDE A NOUN PHRASE OF THE HEADLINE 
INTO EACH WORD, DIVIDE THE NOUN PHRASE 
AT SPACE OR HYPHEN IF THE NOUN PHRASE 
INCLUDES SPACE OR HYPHEN, AND 
REGARDS THESE WORDS AS WORD 
SEQUENCE A 
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DIVIDE A NOUN PHRASE OF THE ARTICLE 
BODY (HEAD SENTENCE) INTO EACH WORD, 
DIVIDE THE NOUN PHRASE AT SPACE OR 
HYPHEN IF THE NOUN PHRASE INCLUDES 
SPACE OR HYPHEN, AND REGARDS THESE 
WORDS AS WORD SEQUENCE B 
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ENGLISH WORD 
"CONSISTED OF CAPITAL LETTERS^ 
INCLUDED IN THE WORD 
SEQUENCE A 



YES 



ADD THE ENGLISH WORD TO 
ABBREVIATION CANDIDATE ALIGNMENT 
RA 



1S\. S75 
„ "WORD SERIES" „ 
WHICH IS CONNECTIVELY ^ 
ONSISTED OF EACH WORD INCLUDING^ 
CAPITAL LETTER AT HEAD POSITION 
^INCLUDED IN THE WORD^ 
^SEQUENCE B. 
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YES 



CREATE CHARACTER SERIES BY 
CONNECTING CAPITAL LETTER OF 
EACH WORD, AND ADD THE CHARACTER 
SERIES TO ABBREVIATION CANDIDATE 
ALIGNMENT RB 



COUNT A NUMBER OF COINCIDENT 
WORDS BETWEEN THE WORD SEQUENCE 
A AND THE WORD SEQUENCE B 



COUNT THE SAME ENTRY BETWEEN THE 
ABBREVIATION CANDIDATE ALIGNMENTS RA 
AND RB, AND ADD THE COUNTED VALUE TO 
THE NUMBER O F COINCIDENT WORDS 
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( END ) 
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/information source N 

^DETECTION PROCESSING J 



m 
m 
o 

w 



. \ 

EXTRACT A NOUN PHRASE 
FROM A HEADLINE BY REFEI 
ANALYSIS RESULT OF THE H 
THE NOUN PHRASE AS A NC 


T THE END OF SENTENCE 
1RING TO MORPHOLOGICAL 
EADLINE, AND REGARDS 
UN PHRASE A 


\ 


/ 
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RETRIEVE A PATTERN "NOUN PHRASE + (report | say | tell)" 
FROM A HEAD SENTENCE BY REFERRING TO 
MORPHOLOGICAL ANALYSIS RESULT OF THE HEAD 
SENTENCE.AND REGARDS THE PATTERN AS A NOUN 
PHRASE B IF THE PATTERN IS RETRIEVED 
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( END ) 
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ENGLISH-JAPANESE PARALLEL CORPUS 



(ENGLISH ARTICLE 1) 



ENGLISH 
WORD 


JAPANESE 
WORD 















(ENGLISH ARTICLE N) 



ENGLISH 
WORD 


JAPANESE 
WORD 
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